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24 ABSTRACT 

25 Based on small-RNA-Seq analysis data, we developed highly sensitive and specific real-time 

26 RT-PCR assays with locked nucleic acid probes targeting the abundantly expressed leader 

27 sequences of MERS-CoV and other human coronaviruses. Analytical and clinical evaluations 

28 showed their non-inferiority to a commercial multiplex PCR test for the detection of these 

29 coronaviruses. 
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30 Coronaviruses (CoVs) have repeatedly crossed species barriers and some have emerged as 

31 important human pathogens (1,2). HCoV-229E, HCoV-OC43, HCoV-NL63, and HCoV-HKUl 

32 predominantly cause mild upper respiratory tract infections, while severe acute respiratory 

33 syndrome CoV (SARS-CoV) and Middle East respiratory syndrome CoV (MERS-CoV) 

34 frequently cause severe pneumonia with extrapulmonary manifestations (3-6). Highly sensitive 

35 and specific laboratory diagnostic tests are essential for the control of emerging CoV outbreaks 

36 (7). The gold standard of laboratory diagnosis of CoV infection is the isolation of infectious virus 

37 from respiratory tract and/or other clinical specimens. However, most CoVs are either difficult or 

38 dangerous to culture in cell lines (8,9). The need of convalescent samples and potential false- 

39 positive results due to cross-reactivity with other CoVs limit the use of serum antibody detection 

40 assays in the acute setting (10). The overall sensitivity of antigen detection assays is inferior to 

41 that of molecular assays such as reverse transcription (RT)-PCR (11,12). With the increasing 

42 availability of molecular diagnostic facilities and expertise in clinical microbiology laboratories 

43 worldwide, RT-PCR has become the test of choice for diagnosing CoV infections (7,13-15). 

44 Traditionally, the preferred targets of RT-PCR assays are genes that are conserved and/or 

45 abundantly expressed from the viral genome (16). For CoVs, the most commonly employed 

46 targets include the structural nucleocapsid (N) and spike (S) genes, and the non-structural RNA- 

47 dependent RNA polymerase (RdRp) and replicase ORFla/b genes (4,7). Recently, other unique 

48 non-coding genome regions not present in related CoVs have also been utilized to develop RT- 

49 PCR for the emerging MERS-CoV (7,13-15). The World Health Organization (WHO) 

50 recommends using the upE assay (regions upstream of the envelope [E] gene) for laboratory 

51 screening of suspected MERS cases, followed by confirmation with either the ORFla or ORFlb 

52 assays (7). Notably, a number of single nucleotide mismatches at different positions included in 
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53 the upE assay forward primer and probe have been detected in recent strains of MERS-CoV and 

54 may affect the sensitivity of this assay (17). We hypothesize that additional gene targets may be 

55 suitable for RT-PCR design for CoVs and would increase the options of molecular diagnosis for 

56 circulating and emerging CoV infections. In this study, we designed and evaluated novel real- 

57 time RT-PCR assays with locked nucleic acid (LNA) probes for clinically important Co Vs based 

58 on the identification of the abundantly expressed leader sequence in the 5’-untranslated region 

59 (UTR) in small-RNA-Seq data analysis. 

60 We included MERS-CoV (strain HCoV-EMC/2012, passage 8, provided by Ron 

61 Foucheir, Erasmus Medical Center), HCoV-229E, HCoV-OC43, HCoV-NL63, and HCoV- 

62 HKU1 in the study. SARS-CoV was not included as there has not been any human case since 

63 2005. The MERS-CoV isolate was amplified by one additional passage in Vero cells to make 

64 working stocks of the virus (5.62><10 5 50% tissue culture infective doses [TCID 5 o]/ml) as 

65 previously described (18). All experimental protocols involving live MERS-CoV followed the 

66 approved standard operating procedures of the biosafety level 3 facility at Department of 

67 Microbiology, The University of Hong Kong, as previously described (19). High-titer stocks of 

68 HCoV-229E, HCoV-OC43, and other respiratory viruses were prepared and their TCID 50 values 

69 were determined using standard methods and as previously described (20,21-23). Attempts to 

70 culture HCoV-NL63 and HCoV-HKUl were unsuccessful because of their difficulty to grow in 

71 cell lines available in our laboratories. Virus-positive clinical specimens (n=14) and laboratory 

72 strains (n=13) used for evaluating the novel assays’ cross-reactivities with other respiratory 

73 viruses were obtained from archived nasopharyngeal aspirates at the clinical microbiology 

74 laboratory at Queen Mary Hospital, Hong Kong. Total nucleic acid extractions of clinical 

75 specimens and laboratory cell culture with virus strains were performed on 200 pi of sample 
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76 using EZ1 vims Mini Kit v2.0 (QIAgen) according to the manufacturer’s instructions. The 

77 elution volume was 60 |il. Extracts were stored at -70°C or below until use. Total nucleic acid 

78 extracts of ResPlex-II-HCoV-positive (n=49) and -negative (n=180) respiratory clinical 

79 specimens prepared by using the QIAamp MinElute Vims Spin Kit were provided by the Hong 

80 Kong Sanatorium and Hospital. A total of 229 fresh or frozen nasopharyngeal aspirates (NPAs) 

81 collected between 1 January 2012 and 31 October 2014 from 229 pediatric and adult patients, 

82 including 128 males and 101 females, aged 1 to 97 years, who were managed in Queen Mary 

83 Hospital and Hong Kong Sanatorium and Hospital for upper and/or lower respiratory symptoms 

84 were included in the study. 

85 The most abundantly expressed sequence in the MERS-CoV genome was determined by 

86 small-RNA-Seq data analysis (Supplementary Information). Approximately 2.6% of the trimmed 

87 reads could be mapped onto the MERS-CoV genome. Among the mapped sequences, the 

88 mapping analysis revealed that most of these small RNA sequence reads, accounting for >6,000 

89 sequences (6.3%), matched the 67-nucleotide leader sequence at the 5' terminus of the genome 

90 (Fig. 1). In contrast, the other peaks at the ORFla, S, and N gene regions accounted for <3.0% of 

91 the mapped small RNA sequence reads. Our mapping analysis also showed that the percentages 

92 of mapped small RNA sequence reads at the gene regions targeted by the previously described 

93 upE, ORFla, ORFlb, N2, N3, NSeq, and RdRpSeq assays, which had longer sequences than the 

94 67-nucleotide MERS-CoV leader sequence, were only 0.2%, 0.1%, <0.1%, 0.3%, 0.1%, 2.8%, 

95 and 0.2% respectively (13,14,24). HCoV-229E, HCoV-OC43, HCoV-NL63, and HCoV-HKUl 

96 similarly possess 70 to 72-nucleotide leader sequences at the same region in their respective 

97 genomes (Fig. 1) (25-28). Although leader sequences of around 60 to 90 nucleotides in length 

98 are found at the 5’-UTR upstream to the transcription regulatory sequence in the genomes and at 
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99 the subgenomic RNAs of all CoVs, the function of these leader sequences remains poorly 

100 understood (29-31). In view of the abundance of the leader sequences and since infected cells are 

101 known to contain large amounts of viral subgenomic RNA at which the leader sequences are 

102 abundantly found (32,33), we hypothesized that the leader sequence might be a valuable 

103 diagnostic target not only for MERS-CoV but also for other currently circulating HCoVs (Fig. 

104 1). 

105 To overcome the relatively short length of the leader sequences, we employed LNA 

106 probes (Exiqon, Copenhagen, Denmark) to develop novel real-time RT-PCR assays for these 

107 human pathogenic CoVs. LNA is a nucleic acid analogue with an extra bridge connecting the 2' 

108 oxygen and 4' carbon that has exceptionally high hybridization affinity towards complementary 

109 DNA and RNA and efficient mismatch discrimination (34). These properties are associated with 

110 an increased melting temperature of the oligonucleotides, which allows the application of shorter 

111 probes when LNA rather than DNA nucleotides are used in the nucleic acid amplification assays 

112 (34). In recent years, LNA probes have been increasingly used in the design of real-time PCR 

113 assays for other respiratory infections such as those caused by avian influenza A/H5N1 virus, 

114 rhinovirus, enteroviruses, respiratory syncytial virus, and Mycoplasma pneumoniae (35-38). 

115 Using LNA probes, we developed five novel real-time RT-PCR assays (named MERS-CoV-LS, 

116 HCoV-229E-LS, HCoV-OC43-LS, HCoV-NL63-LS, and HCoV-HKUl-LS) targeting the short 

117 leader sequences of these CoVs (Table 1). 

118 The analytical sensitivities and specificities of the assays were excellent. The limits of 

119 detection with in vitro RNA transcripts for MERS-CoV-LS, HCoV-229E-LS, and HCoV-OC43- 

120 LS were 10 RNA copies/reaction and those for HCoV-NL63-LS and HCoV-HKUl-LS were 5 

121 RNA copies/reaction. Linear amplification was achieved over an 8-log dynamic range, from 10 1 - 
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122 10 8 RNA copies/reaction for all five assays, with calculated linear correlation coefficients (R 2 ) of 

123 0.99-1.00 and amplification efficiency values of 1.93-2.27. The limits of detection with viral 

124 RNA were approximately 5.62xl0" 2 TCIDso/ml, 5.00xl0" 2 TCIDso/ml, and 3.16xl0’ 3 TCIDso/ml 

125 for MERS-CoV-LS, HCoV-229E-LS, and HCoV-OC43-LS respectively (Supplementary Tables 

126 1 and 2). The limit of detection for the MERS-CoV-LS assay was about one log TCIDso/ml 

127 higher than that for the MERS-CoV-upE assay in parallel runs and was comparable with those 

128 for the other assays currently recommended for screening and/or confirmation of MERS by the 

129 WHO, including the ORFla, ORFlb, RdRpSeq, and NSeq assays (Supplementary Table 1) 

130 (7,14). Comparatively, the ORFlb assay for MERS-CoV has the least optimal limit of detection 

131 of 64 RNA copies/reaction (13,14). Our assays showed no cross-reactivity among the individual 

132 CoVs and with other common respiratory viruses including adenovirus, influenza A and B 

133 viruses, parainfluenza virus types 1 to 4, rhinovirus, respiratory syncytial vims, and human 

134 metapneumovirus (Supplementary Table 3). 

135 Additionally, we assessed the diagnostic performance of our assays and compared it with 

136 ResPlex-II in in-use evaluation using 229 NPAs. ResPlex-II is a commercially available 

137 multiplex PCR assay which detects 18 respiratory viruses including HCoV-229E, HCoV-OC43, 

138 HCoV-NL63, and HCoV-HKUl in a single run. It is commonly employed for laboratory 

139 diagnosis of viral respiratory tract infections in many clinical laboratories worldwide (39,40). 

140 Forty-nine NPAs which were tested positive for HCoVs by ResPlex-II and another 180 NPAs 

141 which were tested negative for respiratory viruses by ResPlex-II (Table 2) were tested in an 

142 operator-blinded manner. Our assays tested positive for the corresponding HCoVs with viral 

143 loads of 1.37xl0 1 -3.86xl0 8 RNA copies/reaction in all 49 (100%) ResPlex-II-HCoV-positive 

144 NPAs (Table 2) {p = 1.00; Fisher’s exact test). Moreover, our assays detected HCoVs in an 
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145 additional 4/180 (2.2%) NPAs which were initially tested negative by ResPlex-II, probably 

146 because of the low viral loads of 2.29xl0'-2.40xl0 2 RNA copies/reaction (ji = 0.12; Fisher’s 

147 exact test). Sequencing analysis and two-step confirmatory real-time RT-PCR assays using 

148 specific primers targeting the N gene of HCoV-OC43 and FlCoV-NL63 (Supplementary 

149 Information) confirmed that the results of these four ResPlex-II-FlCoV-negative specimens 

150 concurred with their CoV real-time RT-PCR assay results (two were positive for FlCoV-OC43 

151 and two were positive for FlCoV-NL63). Overall, these results suggest that our assays are highly 

152 sensitive and specific, and not inferior to ResPlex-II for the detection of FICoVs in vitro and in 

153 clinical samples. It is important to note that while ResPlex-II and other multiplex PCR assays 

154 have the advantage of being able to detect multiple viruses simultaneously, the sensitivity may be 

155 <50% and inferior to monoplex PCR assays for FICoVs and other respiratory viruses such as 

156 influenza A viruses (39,41). This relatively poorer sensitivity would especially limit the 

157 application of these multiplex PCR assays for the detection of future emerging CoVs and avian 

158 influenza A viruses which are potential pandemic agents that have significant public health 

159 impact if a case was misdiagnosed. 

160 Our study has demonstrated the previously unknown diagnostic value of the CoV leader 

161 sequence and the usefulness of small-RNA-Seq data analysis in the selection of optimal gene 

162 targets for the development of molecular diagnostic assays. The application of LNA probes 

163 allowed the use of relatively short sequences such as the leader sequence of CoV genomes as a 

164 diagnostic target in RT-PCR assays. The same approach may be applied to identify and design 

165 real-time RT-PCR assays for other emerging viruses including novel CoVs that are likely to 

166 emerge in the future, once their genomic data become available. As for any other gene targets 

167 used in RT-PCR assays, particular attention should be paid to the presence of polymorphisms in 
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168 the leader sequences, which may affect the sensitivity of the assays. The novel CoV real-time 

169 RT-PCR assays with LNA probes described in the present study should be further evaluated in 

170 large-scale in-field evaluations. Development of these assays into multiplex assays with 

171 comparable sensitivity and specificity and additional detection of other novel or re-emerging 

172 CoVs may further enhance their clinical utility. 
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187 FIGURE LEGENDS 

188 Fig. 1 Schematic diagram of the MERS-CoV genome, with the leader sequence at the 5’- 

189 untranslated region enlarged to illustrate the abundance of the small RNA sequences. The 

190 percentages of mapped small RNA sequence reads at the leader sequence, ORFla, S, and N gene 

191 regions are quantified and shown. Leader sequences of 70 to 72 nucleotides in length are also 

192 present in other human coronaviruses (HCoV-229E, HCoV-OC43, HCoV-NL63, and HCoV- 

193 HKU1). 
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194 TABLE 1 Primer and probe sequences of CoV real-time RT-PCR assays with LNA probes in the present study 0 

195 


Assay 

Genome target 

Genome location 

Primer/probe 

Sequence (5’ to 3’) 

GenBank 
accession no. 

MERS-CoV-LS 

Leader sequence 

14-32 

Forward 

AGCTTGGCTATCTCACTTC 

JX869059.2 



47-69 

Reverse 

AGTTCGTTAAAATCAAAGTTCTG 




34-47 

Probe 

C+CT+CGT+T+CT+CT+TGC 


HCoV-229E-LS 

Leader sequence 

20-41 

Forward 

CTACAGATAGAAAAGTTGCTTT 

NC 002645.1 



57-75 

Reverse 

ggTCGTTTAGTT GAGAAAAGT 




44-59 

Probe 

AGACT+T+TG+TG+TCT+A+CT 


HCoV-OC43-LS 

Leader sequence 

17-28 

Forward 

aaaCGTGCGTGCATC 

NC 005147.1 



43-66 

Reverse 

AGATTACAAAAAGATCTAACAAGA 




32-48 

Probe 

C+TTCA+CTG+ATCT+C+T+TGT 


HCoV-NL63-LS 

Leader sequence 

23-46 

Forward 

ggAGATAGAGAATTTTCTTATTTAGA 

NC 005831.2 



60-77 

Reverse 

ggTTTCGTTTAGTTGAGAAG 




50-66 

Probe 

TGTGT+C+TAC+T+C+TTCT+CA 


HCoV-HKUl-LS 

Leader sequence 

21-37 

Forward 

CGTACCGTCTATCAGCT 

NC 006577.2 



48-71 

Reverse 

GTTTAGATTTAAT GAGATCT G AC A 




39-52 

Probe 

ACGA+T+CT+C+TTG+T+CA 



196 Abbreviations: HCoV, human coronavirus; MERS-CoV, Middle East respiratory syndrome coronavirus; UTR, untranslated region. 

197 

198 a Probes were labeled at the 5' end with the reporter molecule 6-carboxyfluorescein (6-FAM) and at the 3' end with Iowa Black FQ 

199 (Integrated DNA Technologies, Inc). Lowercase letters represent the additional bases added which is not from the original genome 

200 sequence. The letters following “+” represent LNA bases which are modified with an extra bridge connecting the 2' oxygen and 4' 

201 carbon. The bridge "locks" the ribose in the 3'-endo (North) conformations and significantly increases the hybridization properties of 

202 the probe. 
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203 TABLE 2 Comparison between CoV real-time RT-PCR assays with LNA probes and ResPlex-II for the detection of CoVs in 

204 nasopharyngeal aspirates 

205 




Resplex-Il-HCoV-positive NPAs" 
(n = 49) 


Resplex-II-HCoV-negative NPAs a 
(n = 180) 

Assay 

HCoV-229E 

HCoV-OC43 

HCoV-NL63 

HCoV-HKUl 


MERS-CoV-LS 

0/1 

0/17 

0/27 

0/4 

0/180 

HCoV-229E-LS 

1/1 

(1.64 X 10 4 ) 

0/17 

0/27 

0/4 

0/180 

HCoV-OC43-LS 

0/1 

17/17 

(1.37 x 10 1 - 
3.86 x 10 s ) 

0/27 

0/4 

2/180 

(1.84 X 10 2 - 2.40 X 10 2 ) 

HCoV-NL63-LS 

0/1 

0/17 

27/27 

(9.20 x 10 2 - 
3.47 x 10 7 ) 

0/4 

2/180 

(2.29 x 10 1 -9.34 x 10 1 ) 

HCoV-HKUl-LS 

0/1 

0/17 

0/27 

4/4 

(1.94 X 10 3 - 
4.42 X 10 5 ) 

0/180 


206 Abbreviations: NPAs, nasopharyngeal aspirates 

207 “ Values in brackets represent the ranges of quantitative results of positive samples in RNA copies/reaction. 
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MERS-CoV GAUUUAAGUGAAUAGCUUGGCUAUCUCACUUCCCCUCGUUCtJCUUGCAGAACUUUGAUUUUAACGAA 

HCoV-OC4 3 GAUUGUGAGCGAUUUGCGUGCGUGCAUCCCGCUUCACUGAUCUCUUGUUAGAUCUUUUUGUAAUCUAAAC 
HCOV-NL63 CUUAAAGAAOUUUUCUAUCUAUAGAUAGAGAAOUUUCUUAUUUAGACUUUGUGUCUACUCUUCUCAACUAAA 
HCoV-HKUl GAGUUUGAGCGAUUGACGUUCGUACCGUCUAUCAGCUUACGAUCUCUUGUCAGAUCUCAUUAAAUCUAAAC 

























